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Abstract 

Strong embeddings, that is, couplings between a partial sum process of a sequence of random 
variables and a Brownian motion, have found numerous applications in probability and statistics. 
We extend Chatterjee’s novel use of Stein’s method for {—1,+1} valued variables to a general 
class of discrete distributions, and provide logo rates for the coupling of partial sums of inde¬ 
pendent variables to a Brownian motion, and results for coupling sums of suitably standardized 
exchangeable variables to a Brownian bridge. 


1 Introduction 

Let £j, £2 • ■ • be a sequence of independent random variables distributed as £, a mean zero, variance 
one random variable. Letting Sk = L-_i £,-,&= 1,2,..., be the corresponding sequence of partial sums, 
Donsker’s invariance principle fTO . see also (3j, implies that the random continuous function 

X n (t) = + ( nf _ M) £ N+i)> 0 < t < 1 

V n 

converges weakly to a Brownian motion process (B t )o<t<i- One way to study the quality of the 
approximation of X n (t) by B t is to determine a ‘slowly increasing’ sequence f(n) such that there 
exists an embedding of both processes on a common probability space such that 

max | S k -B k \ = O p (f(n)). 

0<k<n 

Finding the smallest achievable order of f(n) has been a very important question in the literature. 

The rate (n log log /z J 1 / 4 (log /i) 0 2 wa s achieved by Skorokhod l l23l l. also see its translation lf24l and 
Strassen j27l assuming E£ 4 < °o using Skorokhod embedding, and Kiefer lfl6l showed that this rate 
was optimal under the finite fourth moment condition. Csorgo and Revesz (8j made improvements to 
the rate under additional moment assumptions. See the survey paper by Obloj l20l and j9| for a more 
detailed account. 

The celebrated KMT approximation by Komlos, Major and Tusnady ( iflTft . lfl8l ) achieved the rate 
log n under the condition that £ have a finite moment generating function in a neighborhood of zero. 
To state their result precisely we make the following definition. 

We say Strong Embedding (SE) holds for the mean zero, variance one random variable £ if there 
exist constants C,K, and X such that for all /7 = 1,2,... the partial sums S& = £* =1 £,■./:= 1,... ,n of a 
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sequence £ 1 , £ 2 ... of independent random variables distributed as £, and a standard Brownian motion 
(B t ) t >0 can be constructed on a joint probability space such that 



for all x>0. 


P max 

V 0<k<n 


( 1 ) 


We adopt the standard empty sum convention whereby So = 0. 

Theorem 1.1 (KMT approximation lfl7l ). SE holds for £ satisfying Eexp0|e| < °o for some 0 > 0. 

Results by Bartfai |Q], see (TO . show that the rate in <[T]) is best possible under the finite mo¬ 
ment generating function condition. A multidimensional version of the KMT approximation was 
proved by Einmahl Ifl2ll . from which Zaitsev ( (291 . (30l ) removed a logarithmic factor. For exten¬ 
sions to stationary sequences see the history in JH, where dependent variables of the form X/ ( = 
G(.. . ,£t_i,£jt,£t+i,. ..) for £,, i € Z i.i.d. are considered. Strong embedding results have a truly 
extensive range of applications that includes empirical processes, non-parametric statistics, survival 
analysis, time series, and reliability; for a sampling see the texts (22l (9l , or the articles fTOTl . (28l and 

ED. 

Here we take the approach to the KMT approximation introduced by Chatterjee 0 that has its 
origins in Stein’s method (26l and appears simpler, and is possibly easier to generalize, than the dyadic 
approximation argument of (171 . This alternative approach depends on the use of Stein coefficients, 
also known as Stein kernels, that first appeared in the work of Cacoullos and Papathanasiou 0. In 
some sense, a Stein coefficient T for a mean zero random variable W neatly encodes all information 
regarding the closeness of W to the mean zero normal variable Z having variance a 2 . Theorem 12.31 
below, from (6), demonstrates that a coupling of W and Z exists whose quality can be evaluated 
uniquely as a function of T and a 2 . Theorem 1 1.21 that demonstrates Theorem 1 1.1 1 for the special case 
of simple symmetric random walk, was proved in 0 applying this approach. 

Theorem 1.2 (Chatterjee 0 ). SE holds for £ a symmetric random variable with support {—1,+1}. 

In this work, using the methods of 0 , we generalize Theorem [L2] as follows. 

Theorem 1.3. SE holds for £, any random variable with mean zero and variance 1 satisfying E£ 3 = 0, 
taking values in a finite set A not containing 0. 

To prove our result we first provide a construction in the case where we have a finite number of 
variables and then extend to derive strong approximation for an infinite sequence. Such extensions 
have been studied in the context of the KMT theorem for summands with finite p- th moment in (l9ll 
and also in 0. 

For the finite case we employ induction, as in 0. The induction step requires extending Theorem 
1.4 of 0 from the special case where £ is a symmetric variable taking values in {—1,1}. The 
generalization depends on the ‘zero-bias’ smoothing method introduced in Lemma l2~4l which may 
be of independent interest as regards the construction of Stein coefficients. Theorem ll.4l here is a new 
result for the embedding of exchangeable random variables and a Brownian bridge. 

Theorem 1.4. For any positive integer n, let £\. £ 1 . ■ ■ ■ ■ £„ be exchangeable random variables taking 
values in a finite set A C 1ft 1 .. Let 
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Then there exists a positive universal constant C, and for all V > 0 positive constants K\.K^_ and Ao 

depending only on A and V, such that for all n > 1 and T] > V, a version of W<) ,W\_ _ W n and a 

standard Brownian bridge (B t ) 0<t<l exist on the same probability space and satisfy 

Eexp(A max |W* — y/nr\B k /„\) 

0 <k<n 

/K X 2 S 2 \ 

< exp(Clogn)Eexp f —-- + KA 1 nff — T ] 2 ) 2 j for all A < Ao- 

Moreover, if 0 0 A, then there exist positive constants K\ and Ao depending only on A such that 

/ k 2 2 S 2 \ 

Eexp(A max I W k — a/^ 7^,/,,1) < exp(Clog«)Ecxp ( —-- ) for all A < Ao, 

0 <k<n ' \ n ) 

and if in addition £i,..., £„ are Li.d. with zero mean, then there exists a positive A depending only on 
A such that 

P ( max | Wk — \fnyB k / n \ >A 'Clogn+jcJ < 2e for all x > 0. 

\0<k<n ' J 

The constant C is given explicitly in (ITil in the proof of Theorem 13.11 its numerical value is roughly 
8.4. The constants in the second inequality of Theorem [L4]are those that appear in the first inequality, 
specialized to a case where the lower bound v depends only on A. 

Our extension of the Rademacher variable result of (6| requires a number of non-trivial compo¬ 
nents. Example 3 of |6j] demonstrates how to smooth Rademacher variables to obtain Stein coeffi¬ 
cients, and the author states ‘we do not know yet how to use Theorem 1.2 to prove the KMT theorem 
in its full generality, because we do not know how to generalize the smoothing technique of Example 
3.’ We address this point by the zero bias method of Lemma 12.41 that shows how any mean zero, 
finite variance random variable may be smoothed to obtain a Stein coefficient. 

Additionally, dealing with variables restricted to the set {—1,1} avoids another difficulty. In 
particular, the second inequality of Theorem l 1 .4l shows that the ‘natural scaling’ for the approximating 
Brownian bridge process depends on the variance parameter y 2 = /A 1 £' ! =1 £ 2 , which in the case of 
Rademacher variables is always one. In fact, for such variables, the variance parameter remains the 
constant one when restricted and suitably scaled to any subset of variables. In contrast, in general 
when applying induction to piece together a larger path from smaller ones, their respective variance 
parameters may not match. This effect gives rise to the term (y 2 — t ] 2 ) 2 in the exponent of the 
first inequality of Theorem 11.41 which then needs to be controlled in order for the induction to be 
completed. In doing so, one gains results on the comparison of the sample paths of a more general 
classes of exchangeable variables to a Brownian bridge. 

The second claim of Theorem l 1.4l is shown under the assumption 0 f A. This condition becomes 
critical precisely at (l63l> . where we require that the smallest absolute value of the elements of A is 
positive, from which one then obtains a lower bound v on y when invoking Theorem l3.ll This same 
phenomenon occurs in the proof of Lemma |4~T1 on the way to demonstrate Theorem 1 1.3 1 

The remainder of this work is organized as follows. In Section^ we prove two theorems, one for 
coupling sums S„ of i.i.d. random variables, and one for coupling W n of Theorem 11.41 to Gaussians. 
We also prove Lemma 12.41 which shows how to construct Stein type coefficients using smoothing 
by zero bias variables. Theorems 13. II and 1 1.41 the first result a conditional version of the second, are 
proved in Section[3j and we prove Lemma |4~T1 implying Theorem 1 1.31 in Section 0] 
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2 Bounds for couplings to Gaussian variables 

In this section we prove Theorems 12. II and 12.21 generalizations of Theorems 3.1 and 3.2 of O, and 
our zero bias smoothing result, Lemma 12.41 The first theorem gives bounds on couplings of sums 
S n of i.i.d. variables, and the second on coupling of certain exchangeable sums to Gaussian random 
variables. 

Theorem 2.1. For every mean zero, variance one bounded random variable £ satisfying E(e 3 ) = 0 
and E(s 4 ) < °o, there exists Q\ > 0 such that for every positive integer n it is possible to construct a 
version of the sum S n = £" =1 £, ofn independent copies of £, and Z n ~ A r (0. n), on a joint probability 
space such that 


Eexp(@! \S„ — Z„|) < 8. 


For convenience, we adopt the convention that a normal random variable with mean p and zero 
variance is identically equal to p. 

Theorem 2.2. For n > 1, let £i, £ 2 ,... £ n be arbitrary elements of a finite set A C M, not necessarily 
distinct. Let y 2 = n 1 Yli=\ ^ et ft be a uniform random permutation of {1,2,... ,n}, and for each 
1 <k <n let 


k 

S k = Z £ x(i) and W k = S k 


kS n 


( 2 ) 


n 


Then for all V > 0 there exist positive constants c \,cj and O 2 depending only on A and V such that 
for any integer n > 1, an integer k such that \ 2 k — n\ < 1, and any rj > v, it is possible to construct a 
version ofWk and a Gaussian random variable with mean 0 and variance k(n —k)/n on the same 
probability space such that for all 6 < 62 , 



We now define Stein coefficients, the key ingredient upon which our approach depends. Let W be 
a random variable with E[W] =0 and finite second moment. We say the random variable T defined 
on the same probability space is a Stein coefficient for W if 


E[W/(W)]=E[r/'(W)] 


(3) 


for all Lipschitz functions / and f any a.e. derivative of /, whenever these expectations exist. 


Theorem 2.3 (Chatterjee (6)). Let W be mean zero with finite second moment and suppose that T is 
a Stein coefficient for W with \T\ almost surely bounded by a constant. Then, given any O 2 > 0, we 
can construct a version ofW and Z ~ A"(0. a 2 ) on the same probability space such that 



for all 9 G M. 
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To prove Theorems [2j] and [221 we require the following definitions. Following Section 3.2 of 
m, see also Proposition 4.2 of Q, for X a random variable with finite, non-zero second moment, 
we say X D has the X-square bias distribution when 

E[/(xD)] = (4) 

for all functions / for which the expectation on the right hand side exists. For a mean zero random 
variable X with finite, non-zero variance a 2 , we say that X* has the X-zero bias distribution when 

a 2 E[/'(X*)]=E[X/(X)] (5) 


for all Lipschitz functions / and any a.e. derivative /', whenever these expectations exist. That X* 
exists for such random variables, see IH and Q. 

If X is a mean zero random variable with finite, non-zero variance a 2 , then for any g €= C c , the 
collection of continuous functions with compact support, letting f(x) = / 0 ' g{u)du, using ©, we have 


a 2 Eg(UX L 


= <j 2 ~Ef'(UX 

-I 




= <7 2 E [ f'(uX a )du 
J o 

Ax a y 


= a E 


= E 


X 


x n 

2 m 


X 

= E [X/(X)] 


where X D and U are independent, U ~ U[0,1] and X n has the X-square bias distribution. Thus, using 
©, we have 


a 2 E g(UX°) = E [X/(X)] = <7 2 E[/'(X*)] = a 2 E[g(X*)]. 

Since the expectation of g(X*) and g(UX l ] ) agree for any g G C c , with = r j denoting distributional 
equivalence, we obtain 


X* = d ux u . 

Smoothing X by adding an independent random variable Y having the X-zero bias distribution, 
we obtain the following result which will be used for constructing Stein coefficients for sums. 


Lemma 2.4. IfX is a mean zero random variable with finite non-zero variance, and Y is an indepen¬ 
dent variable with the X-zero bias distribution, then 

E[x/(x+y)] =e[(x 2 -xf)/(x+f)] 

for all Lipschitz functions f and a.e. derivative f for which these expectations exist. 
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Proof. Let V be distributed as X, let U be a U[0,1] random variable, and set 

Y = UV U 

where V, U,V n and X are independent. Note that for any bivariate function g for which the expecta¬ 
tions below exist, by © we have 


E[g(X,V D )] = - 7 E[L 2 g(X,l/)], 


( 6 ) 


where a 2 is the variance of X. Hence 


E [{X 2 -XY)f'(X + Y)] 

=E[(X 2 - XUV u )f'(X + UV U )] 


=E 


=E 


=E 


f l (X 2 -XuV D )f(X + uV n )du 
Jo 


(X 2 —XuV u )f(X + uV u ) 
V 5 


+ XF L 


□ r'f(x + uV n ) 




(X 2 -XV u )f(X + v n ) -X 2 /(X) 
X 5 


V D 

+E[x/(x+y)] 


clu 


=^E[F(X 2 - XV)f(X + V) - VX 2 f{X)} + E[X/(X + X)] 

~E[VX(X-V)f(X + V)] +E[X/(X + 7)], 

o L 

where we have used © in the second to last equality, as well as the independence of V and X, and 
that EL = 0, in the last. Hence, to prove the claim, it suffices to show that the first term above is zero. 
Since X =^V and V and X are independent and exchangeable, we have 


vx(x-v)/(x + v) = d vx(v-x)f(x+v) = -vx(x-v)f{x + v), 

demonstrating that the expectation of the expression above is zero. 


□ 


For any mean zero X with finite, non-zero variance a 2 the distribution of X* is absolutely contin¬ 
uous with density function 

px . ix)= mifixi. a. 


One finds directly from © that 


a < X < b for some constants a < b implies a < X * < b. (8) 

Comparing © with ©, we see that T is a Stein coefficient for X if <7 -2 Z?[!T|X] is the Radon 
Nikodym derivative of the probability measure fj.* of X* with respect to the measure /./ of X. 
Hence, in light of (©. if X is a random variable with mean zero and finite variance, having density 
function px(x) whose support is an interval, then setting 

h x (x) = > 0) we have E[X/(X)] = E [h x (X)f'(X)] (9) 

Px{x) 
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for all Lipschitz function / and a.e. derivative f for which these expectations exist, that is, hx (X) is a 
Stein coefficient for X. We note the first equality in © shows, by virtue of E(X ) = 0, that lix (x) > 0. 

Now consider a random variable X having vanishing first and third moment, variance strictly 
between zero and infinity and satisfying E(X 4 ) < oo. Then the distribution for a random variable Y 
having the X-zero bias distribution exists, and from © with g(x) = x 2 and g(x) =i 3 , we respectively 
find 


E(y)=0 and E(T 2 ) < oo. (10) 

Moreover from © we see that Y has density function p Y (v) whose support is a closed interval. Hence 
the function h Y (y), given by the first equality in ©, satisfies the second. 

Lemma 2.5. Let £],... £„ be independent and identically distributed as £, a random variable with 
mean zero, finite nonzero variance, and satisfying E(£ 3 ) = 0 and E(£ 4 ) < oo, and let Y have the 
E-zero bias distribution and be independent of £i,... ,£„. Then for all Lipschitz functions f and a.e. 
derivative f, 

E[S n f(S n )]=E[Tf'(S n )] 


where 


S n — S„ + Y with S n — £i + £2 + • • • + E n , 


and 


n 

r = £ £f - S n Y + hy (Y) with hy (y) 

i= 1 


E[Fl(y > y)] 
pviy) 


Hpriy) > 0). 


Proof. With si° = S n — £,-, we have 

E[5„/(5„)] =E[ 1 S„/(S r „) + y/(S r J! )] = j^Eteftei + Y + sPy+EFfiY + Sn)]. (11) 

i= 1 

For the first term of (fTTT) . using that the summands £,- are independent and applying Lemma [2~fl 
yields 


E[£,-/(£,- + y+si 0 )] = E[(£, 2 - £ ( y)/(£;+y+si 0 )] = e [( £ 2 - £ i -y)/'(s„)]. 

Now turning to the second term of (fill) , we first note that by © the assumption that the third moment 
of £ is zero implies E(Y) = 0. Now using the independence of Y and S n . © yields 

E[y/(y + S„)] =E[h Y (Y)f(Y + S n )\ =E[h Y (Y)f(S n )\. 

Substitution into (fill) now yields the claim. □ 

Hoeffding’s lemma, e.g. see the proof of Lemma 2.2 of 0 ], will be used below. It states that if X 
is a mean zero random variable that satisfies a < X <b almost surely, then 

E[exp(0X)] < e (*- fl ) 202 /8 for a p Q e M. (12) 


7 



We also require the ‘non central X\ moment generating function identity, 

, ,, „ “pfjirasj) 

Eexp (aV 2 + /3 V) = (1 /‘ g)1/ / 

valid for the standard Gaussian variable V, and all /3 € M and a < 1/2. 
For the law C(X) of any random variable X let 

£(C(X)) = inf {b-a:P(a<X<b) = 1}, 


( 13 ) 


the length of the support of X. For notational simplicity we will write £(X), or i when X is clear from 
context, for £(C(X)). We use that £(X) is translation invariant in the sense that £(X) = £(X — c) for 
any real number c without further mention. 


Lemma 2.6. For every almost surely bounded random variable X, there exists a constant dp(x) € 
(0,°°) depending only on £(X) such that when X\,X 2 ,... are independent random variables dis¬ 
tributed as X, the sum S n =X\ -\ - \-X n and /./ = EX satisfy 


E 


exp 6 


c2 
n 2 ^n 


( 4 


< - exp ( -n6 2 p 2 1 for all n > 1 and \ 




The constant 4/3 is somewhat arbitrary as any value greater than 1 
Theorem 13. 1 I requires a value strictly less than 3/2. 


0 | < ®nxy 

can be achieved; the proof of 


Proof. Let V be a A r (0-1) random variable independent of X. Using Floeffding’s le mm a (fT2l) condi¬ 
tional on V, for any function of V we have 


E[exp(f(V)(X —/i))|V] </ f(y ) 2 / 8 . 


Applying E(exp 6V) = exp(0 2 /2), for £6 < sfl and V independent of Xi,X 2 ,..., letting t(V) = 

fyfofy, we obtain 

fit 


E 


c2\ 1 


exp [ e 2 ^ 
n 


= E 

< E 


= E 


exp ( VlO—^zV 
\Jn 


E ( exp(t(F)X) 
(2£ 2 6 2 V 2 


V 


= E 

= E 


V 


E ( exp ( \p2Q 


V 


exp 


V 8 n 


hs/2 0p^= 

sjn 


E( exp (t(V)(X-fi) + t(V)n) 

(Pe 2 


V 


= E 


exp 


1 


^i-Pe 2 /2 


exp 


n0 2 p 2 \ 

\-i 2 e 2 /2 ) 


\ 4 

< 


V 2 + V2 QpfyTiV 

1 


\-i 2 e 2 /2 


exp 


n0 2 p 2 \ 

\-i 2 e 2 /2j 


where we have applied (fl3l) in the last line. It is now direct to verify that the property required by the 
lemma holds by letting diyx) = 1/ (s/2£(X)), the unique positive solution to 


1 _ 4 

1 -£{xfe 2 /2 - 3 


□ 































Lemma 2.7. Let £ be a bounded, mean zero, variance <3 2 € (0,°°) random variable satisfying E£ 3 = 
0. Then the Stein coefficient hy{y), given by ©/or Y with the E-zero bias distribution, is bounded. 

Proof As £ is a mean zero random variable with finite, nonzero variance, the zero bias distribution 
C(Y) exists. As E£ 3 = 0 and £ is bounded and non-trivial, as in (flOl) one verifies that EF = 0 and that 
Var(F) is positive and finite. Hence, as noted below ©, the Stein coefficient h Y {y) as given by © is 
nonnegative, so we need only show that it is bounded above. 

From ©, an a.e. density of Y is given by 

Py 0) = —T [ udF e (u ) (14) 

<J Z Jy 

where we use Fx to denote the distribution function of the random variable X. From (fl4l) we may 
observe that the support of Y is the smallest closed interval of R containing the support of £. Since £ 
is bounded and has mean zero, using 0, this interval is of the form a.b] for — °o <a<0<b<°°, 
hence for t E [a,b\ the upper limit of the integral in (fl4l) may be replaced by b. 

In particular', for all t e [0,Z>], by © we have 

MO = S ' yPY ff y 

Py if) 

_ fl’ySy udF e (u)dy 
a 2 p Y {t) 

_ f Jt< y <u<by udF r( u ) d y 

a 2 p Y {t) 

_ It b urydydF e (u) 
a 2 p Y {t) 

f t b u(u 2 — t 2 )dF e (u) 

2 a 2 p Y (t) 

b 2 J t b udF e (u) b 2 

— 2 a 2 p Y {t) 2 ’ 

where we have used Fubini’s theorem in the fourth equality, and (fl4l) in the second and sixth. As 
h- Y (t) = h Y (—t) we obtain that h Y {y) is bounded for t £ [a, 0]. □ 

Proof of Theorem 12.11 For short we write S = £i + £o H-b e„ and S = S + Y with Y is as in Lemma 

12.51 As the third moment of £ is zero and its fourth moment is finite, as in (ITOl) . Y has mean zero with 
finite variance, and hence so does S. 

n ~ 

By Lemma 1231 T = £ £? — SY + h Y (Y) is a Stein coefficient for S. Since £ is bounded and the 

i— 1 

third moment of £ is zero, Lemma 12.71 yields that h Y (Y ) is bounded. Also £ bounded implies S is 
bounded. In addition, as e is bounded there exists some B such that |e| <B, and © implies F <B. 
Thus, we conclude that |T| is bounded. 

Now invoking Theorem 1231 there exists a version of S and Z ~ A r (0. a 2 ) on the same probability 
space such that 

Eexp(0|S r —Z|) <2E(exp(20 2 a' 2 (r-a 2 ) 2 )) for all 6 e R. 
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Using |T| <B we have |S — S| < B. It follows that, 

Eexp(0|S-Z|) <2E (exp(fl|0| +26 2 c~ 2 {T - a 2 ) 2 )) . 
Letting Co > B be such that |/jy(L)| < Co, and setting a 2 = n, we obtain 


,-2 


(T - a 2 ) 2 3r + 3 C^S- + 3Cq 


r2 


where S = £ (e 2 — 1). 

/=! 


Hence, 


Eexp(0|S —Z|) < 2exp ( 5|0| 


+ ■ 


6Cn0 2 


< exp 5|0| + ■ 


6C,, 0 


<2q2 


^ Eexp ^60 


2 s 2 +c 2 s 2 ' 


E 


120 2 S~ 


exp 


+ exp 


( 120 2 CnS' 2 


V n 


(15) 


where we applied the simple inequality exp(.r + v) < (e 2x + e 2 y )/l. 


Noting for S and S that £ 2 — 1 and e respectively are bounded and have mean zero, using Lemma 
12.61 for the first two inequalities below, we see that there exists Q\ > 0 such that for all |0| < 0] and 
all positive integers n 


Eexp(120 2 5 ,2 /n) < 2 and Eexp(120 2 Co5 ,2 /n) < 2 and exp(fi|0| + 
Theorem 12. 1 1 now follows from (fl5l) . 


6C 2 0 2 


< 2 . 


□ 


We now prepare for the proof of Theorem I2.2l bv providing a few lemmas. For A the finite set in 
which the basic variable e takes values, let 


T> = {b — a : a,b € A} and T )+ =Dn[0,°° 


(16) 


the set of differences of the elements in A, and those differences that are non-negative. We note here 
that V is symmetric in that V = —V. Let also 


B = max \a\ 

aeA 


(17) 


Recall the definition <[2]) of Wk and observe that we may write 

^ = ft - k Jn = i <*(0 - £ I £*(/) e .(0 - ! I e n{i) 


i=i 


i=l 


1=1 


n i=k+i 


k n 


I ( e ^(0 £ n(j))> 


n f 


i=l j=k+\ 


and therefore 


k n 


W k— 52 W k,d where Wk,d — n ^ ( £ ^(0 £ ^U)^(\e^i)-e n u)\= d )- 


(19) 


del 5+ 


;=l j=k+] 
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Lemma 2.8. Under the hypotheses ofTheorem \2.2\ for any d G R, 1 < k < n and d G D 1 we have 
Eexp(OWk,d/Vk) < exp(d 2 0 2 /2) and 'Eexp(OWkfVk) < exp(fi 2 0 2 ), (20) 

where B is as in G3>. Further, there exists (Xq > 0 depending only on A such that 

E[exp {otW^ d /k)\ < 2 for all |a| < ao and all d G V + . (21) 


Proof. We may assume d > 0 as the result is otherwise trivial. Fix an integer k in [1 ,n\ and d G T> 1 , 
and let m(6) := Eexp (6Wk,d/Vk). We argue as in & Since Wkj is bounded, the function m[6) 
is differentiable and differentiation and expectation may be interchanged. Hence, using (fl9l) for the 
second equality, 

m\e) = -^=E(W M exp(0W M /v4)) 

j k n 

= 7f Y Y £ K{j))^{\£^-e % ( j )\=d) ex V{QWk,d/'/-k)\- ( 22 ) 

n v«: ,- = i ;=r-+i 

Now, let i and j satisfying 1 < i < k < j <n be arbitrary and let n' = no (/./) where (i.j) is the 
transposition of i and j. Then (n. n') is an exchangeable pair of random permutations. Let W' kd be 
defined as in (fl9l ) with n' replacing n. Using exchangeability for the first equality and the definition 
of n! for the second, 


^[( £ 7r(/) — £n{j))^-(\ex(i)-Ex(j)\=d) ex P( 6Wk,d / V^)] 

=E[(£^/(,-) — exp (6Wk'd/y/k)\ 

=E[(£^(y) — = d) exp {0W k d/Vk)\ 

= — E[(£^(,') — £ 7c(j))^-(\e K ^-e^\=d) exp (dW k d/Vk)\. 
Averaging the first and last expressions yields 

®[( e rc(0 ~ £ 7r(;'))-®-(|e I r(,)-ei(;)|=<A) ex p(0Wyt,<;/>/&)] 

= ^ E [( e *(0 - £ K{j))^{\e^-e K(j) \=d)^^ W k4/^) -exp^W^/v/fc))]. 

Note 


I Wk4~W' k4 

Y ( e w(0 — e w(o) 1 (|e^(i)-e < ,)|=d) + Y ( £ ro(0 - e w(y)) 1 (|e^(()-e^)l= d ) 

k+\<l<n,l^j 1 <l<k,l^i 

+ { £ n(i) ~ £ 7 i(j))^-{\e K{i] -e nU) \=d) ~ ( Y (%(0 _ %(/))l(|e Jt ,( i) -£^( i) |=d) 

k+l<l<n,l 7^ j 

+ Y ( £ n'(l) ~ e Tt'{j))^-{\e n , m -e n , u) \=d) + (%(/) — %(i))l(|e I / (i) -e I / a) |=rf)J 
\<l<k,Ui 


Y( £ n(i) £ Kd)^tK{i)-£n(,)\=d) + Y( £ xd) £ xU))^(\en(l)-en(j)\=d) 


1= 1 


;= i 


< — [nd + nd\ = 2d. 
n 


(23) 
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Now applying the inequality \e x — e y \ < ^\x — y\(e x + e y ) we see that (1231) in absolute value is 
bounded by 


101 


4 y/k 


lE[|e 7r(j . ) - |l(| e3r(0 - ejrO) |= rf) I|((exp(0444^/Vfc) + exp(0W^/V^))] 


< 


101 


Asfk 

\0\d 2 

Vk 


2d 2 E[exp(6Wk,d / Vk) + exp {dW[ d / Vk)] 

m(0). 


So, from (f22l) . and the fact that 1 <i<k and k < j < n arc arbitrary, we obtain 

\m'{e)\< 1 ^ d Yt L m i e ) <d 2 \e\m(0). 

nVk Vk i= i j=k+l 


Now, using, m( 0) = 1, and that m(6) >0 for all 0 G M, for 0 > 0, we obtain 


/o 


m'{u) 
m(u ) 


du < 


j 

d udu 


m{6) < exp(d 2 0 2 /2) 


and for 0 < 0, we obtain 

r0 —m!(u) 


[ U du < [ d 2 (—u)du => m{6) < exp(d 2 0 2 /2), 
Je m(u) Je 


proving the first inequality of (l20l) . 

Arguing similarly, now letting m(0) := Eexp {0W k /Vk) and W k as in ([2]) with n' replacing n, 
noting |Wk — W' k \ = \e K {j) — e^j) \ < 2B, we obtain 


E[(e*(0 - £jtU)) ex P(0W k /Vk)] < E[(e s(i) - £ n{j) ) 2 {{exy{dW k / Vk) + exp{6W[/Vk))\ 

10 \AB 2 . . 2101 B 2 .. 

< -- —2m{9) = —m(0), 


4y/k 


Vk 


so that \m'(6)\ < 2B 2 \Q\m(9), implying the final inequality of (l20l) . 

Turning to (l2Tb . letting Z be a standard normal random variable independent of W k j, by (l20l) and 
(fT3l) . for all d € V + and a < 1/ {2d 2 ), we have 


Eexp(aW kd /k) = Eexp (^VZaZW^d / Vk) < Eexp(d 2 aZ 2 ) < -j=—= 


Now set Oo so that the bound above is any number no greater than 2 when d is replaced by max {d : 

d£V+}- □ 


Lemma 2.9. Under the assumptions of Theorem 12.21 there exists (X\ > 0 depending only on A such 
that for all n, all 1 < k < 2«/3, and all 0 < OC < tt\, 

Eexp {aSl/k) < exp 
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Proof. The steps are the same as in the proof of Lemma 3.5 of Q. For Z a standard normal random 
variable independent of it, by definition (0) of W k we have 


Eexp (aS\/k) = Eexp —S^Z 



( 2a 2a kS n 

= Eexp J—W k Z+J- --Z 

\ V k V k n 


By (l20l) . with B given by (fT71) . for the first term we obtain the bound 


E 


exp ^ \[2aZW k /Vk 


< exp(2 aB 2 Z 2 ). 


Thus, 


Eexp(aSf/&) < Eexp 


^2aB 2 Z 2 + 


12a kS n 
V k n 



Recalling S n is nonrandom, using the non central xj identity (fl3T) . we find that 

Eexp(aSlfk) < 1 = exp (——^ for 0 < a < 1/(4B 2 ). 

Vl-4a B 2 F V(l-4 aB 2 )n 2 J M ' 

The proof of the lemma is now completed by bounding k by 2n/3 and choosing a\ > 0 small enough 
so that 1/(1 — Aa\B 2 ) is sufficiently close to 1. □ 

Proof of Theorem l22l - We assume 6 > 0. Applying our convention that zero variance normal random 
variables are equal to their mean almost surely, when n = 1 we have So = Wo = W\ = Zo = Z\ = 0 
and the result holds trivially, so we assume n > 2. Recalling definition (fl6l) of V 1 for each d > 0 in 
V + and that V is symmetric, let Y r j have the uniform \J[—d/2,d/2] distribution, and be independent 
of each other and of the uniform random permutation n, and for d = 0 let To = 0- Set 


^ = I Yd- 

deV+ 

For arbitrary i,j satisfying 1 <i<k< j < n let Fij = a{ 71 (d) : 1 f {/,,/}}. Regarding the collec¬ 
tion {£1 ,...,£„} as a multiset, we have 

) > £ x(j )} — {&id 1,.. . ,/l} \ {£jrp); l 0 j }}; 

showing that {e n ^, }, and therefore also £ 7lil] + £ Kijj and d,j := |£ K{l) — £ n (j) I are measurable with 

respect to F\j. Further, the conditional distribution of 

v _ £ x(<) ~~ £ K(j) 

ij ' 2 

given Fjj is uniform over the set {—dij/2,dij/2}. 
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Let = S k - £n(i),W^ ,] = - ( k/n)S n and Y^ lj) = Y- Y dij . For e w(i ) / £ n y), applying Lemma 

12.41 and the easily verified fact that the zero bias distribution of the variable that takes the values 
{—a,a} with equal probability is uniform over [—a,a], for some fixed Lipschitz function /, we have 

E [(^ (0 -£^))/(W, + F)|^ 7 ] 

=2E [XtjfiXij + Y dij + W f + (e* (0 + £ kU] )/2 + F :// ')'./// 

=2E[(^- - XijY^f'iXij + Y^+W® + (e* {0 + £ n{j) )/2 + Y™){Fy} 

=2E [(X?j -XijY dij )f(W k + Y)\Fij} 

=2E[(4-/4 - XijY dij )f\W k + Y)\Tij\. 


sides being zero. Taking expectation we obtain 

E[(e*( 0 - e n{j) )f(W k + y)] = E [tijf(W k + Y)] 

where 


'v = 2 hr-W«d = 


o2 _i_ 

8 K(i)^ 8 7lU) 


It is easy to verify using ©, or by integration by parts, that for U ~ U[ -a.a]. 


also when £ K y 

) = £<,> both 


(24) 

~ e n(j)) Y dij■ 

(25) 

U[— a, a]. 



num\=-n(a 2 -u 2 )f(u)], 


implying 


E [Yf(W k + Y)]= Y E[Y d f(Y d +W k + {Y-Y d )\ 
dev+ 

1 


= ^E E 


' deT>+ 


-j-Y 2 )f'(Y d + W k + (Y-Y d ) 


= E[R 4 f'(W k + Y)}, (26) 


where 


R 4 = 


? \ 4 

^ dev+ x H 


■~YJ 


Since T> + is finite there exists Co > 0 so that 


|^4|<C 0 . 


(27) 


From m . 


so lettting 


W k 


1 

n 


k n 


Y Y 

i— 1 j—k -\-1 


£ Tt(j) )) 


W k = W k + Y 


(28) 
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and combining (l24l) and (l26l) . we have 


E[W k f(W k )\=E[Tf'(W k )\, (29) 

where the Stein coefficient T, in light of (1251) . is given by 

j I: n 

T = — ^ XL t\j + /?4 = R\ — /?2 — R3 +R41 

11 i=\j=k+\ 


where 


R 




Ri 


| i n 

~Yj £ k{i) Y £ x(j)’ 

n i= 1 j=k+1 


and 


R 3 - XL ( e ^(0 e x(j)) Yd ij 

1 \<i<k< j<n 

= Y Yd ~ XL ( £ «(o 

d€V+ 1 <i<k<j<n 

with Wk.d as in ( fl9l ). Since \Y c i\ < d/2, we have 


1(I^(0 e ?r(./')| — — XI Y dWk,di 

dev+ 


l* 3 |< £ d^l- 

J€X>+ Z 

Recalling that v > 0 is a given fixed number, and that 

7 1 ^ T ~7 k(n — k)y 2 2 k(n — k)tf 

y~ = - V £f, set a- =- and a- =--—, 

«r-j » n 

for a positive constant rj > v, noting that since n > 2 both a 2 and a 2 are positive. Then, 
(T-a 2 ) 2 _ n 


k(n — k)r\ 2 


[R 1 — C“ — R2 — R3 + R4) 


(30) 


< 


&(« — k)v 2 


(Ri — a 2 —R 2 —R 3 +R 4) 2 ■ (31) 


15 







To bound this quantity, consider first 


R\ — o~ 


= 2 ~ ((«-*) E 4)+* E e lu) )- 


=^[( n - k )L £ ki) +k E £2 


i=t 

k 


j=k+\ 


1 

2 n 


i=l 

k 


j=k+\ 


n(j) 


k(n — k)JL 2 


1=1 


nr 1 , A 2 2 k(n -k) -A 2 

'-*)E e *(0+* £ e xU) ---£ e n{i) 


r(‘V K fc 7r(;) 
1=1 j=k +1 

/ k r. n 

,2 K V „2 


i=l 


= ^(( n -*)|E e *(o-^I e ;(o]+*l I 


" 2 n-k". 


2 n 

1 

272 


, 1=1 


J=*+l 


£ xU) 


j =i 


(72 — &) 


n — k 


k " 


Mj) 


n — k 


L e *(o--1 e *(o +*k I 4-)- — 


7=1 


71 /=*+! 


' ;'=*+! 


.7=1 


w(/) 


(((”-*) 2 -*(«-*))L e *(«-)-(( n -*)*-^) E 


2 n 2 

72 — 2 k 
2t2 2 


t(0 _ -7CJ7C-/C ; z^U) 

7=1 7=*:+l , 


■- k )L4®- k I *4) 

1=1 ;=*+l / 


Hence, for all £ = 1,2,..., 72 , with B as in (fTTT) . 


|/?i -<y-|; 


< 


172 — 2fc| 

2 n 2 

172 — 2fc| 


(n-*)£ei 0 + fc £ e 2 (y) +|a 2 -a 


1=1 


i=k+l 


7 2 + |a 2 -a 2 |< tM B 2 + |5 2 -a 2 | 


Choosing k such that |?2 — 2k\ < 1, we obtain 


|Hi -v 2 \<-y + \<y 2 -<y 2 \ 


Regarding R 2 , for any k G {1,... , 72 } we have 


(32) 


l*2| = 


1 


I £ ,(0 E e ^U) 

7=1 j=k+\ 


<B\S k \. 


Hence, for k such that \2k — n\ < 1, from (f3l~b . (l32l) . (f33l) . (l30l) and 
|^-^(y 2 — t ? 2 )I> we obtain 


(33) 


and that I a 2 — a 2 1 = 


(T-a 2 ) 2 

rr2 - 


k (72 — &)v 2 


R 2 /2 + |a 2 -a 2 |+B|S^|+ £ ^|W M |+C 0 


dev+ 


<C 


( 1 +72(y 2 


c2 U/2 ' 

-t? 2 ) 2 + ^+ £ 

k deV+ k 
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for some constant C depending uniquely on A and v. 

We now verify that the hypotheses of Theorem [23] hold for Wk of (I28T) and T. Clearly Wk satisfies 
E(W*) = 0 and E (W 2 ) < oo. By (l29l) . T is a Stein coefficient for W k , and T is easily verified to be 
bounded. Writing Z for short for r\Z k in the statement of Theorem 12.21 we note that Z is distributed 
jV"(0, tr~), and by Theorem l2.3l we can construct a version of Wk and Z on the same probability space 
so that for all 0, 


Eexp(0|W4 — Z|) < 2Eexp(20 2 a 2 {T — a 2 ) 2 ) 


< 2Eexp (iCQ 2 ( 1 + n(y 2 — r] 2 ) 2 + y- + £ W * 4 




k ' k 

* dev+ K 


With D = 4L f /eD+d we have \Wk — Wk\ < |T| < £ |T f /| < D. Letting q = |D + | + 1, we have 

d€V+ 


Eexp(0|W*-Z|) 


/ s 2 W, 2 .' 

< 2exp(D|0| + 2C0 2 + 2C 0 2 7i(y 2 — T] 2 ) 2 )Eexp I 2C6 2 -^- + 2C0 2 £ — 




d€V+ 


< -exp(D|0| + 2C0 2 + 2C0 2 n(y 1 — T] 2 ) 2 ) | Eexp (lCqQ 2 ^-\ + ^ Eexp|2C^0 


32 W l d 


deT>+ 


by the convexity of the exponential function. Using Lc n i m as [2. 9 1 and l23l there exists 03 > 0 depend¬ 
ing only on A and v such that for all 0 < 03 , we obtain 


Eexp(0|Vl4 — Z|) < -exp(D|0| + 2C0 2 + 2CQ 2 n{y 2 — T] 2 ) 2 ) ( exp ( l+6Cqd 2 ^) +2(q-l) 
a \ \ 4 n J 


<2exp(D|0|+2C0 2 + 2C0 2 7i(y 2 -T7 2 ) 2 ) (exp ( 1 + 6Cc/0 2 ^ ) +2 


471 

S’ 2 
-,2^n 


Now choose 04 > 0, depending only on A and v, so that 

2 exp(D 04 + 2C0|) < e and note exp (1 + 0 2 .r) + 2 < exp (2 + 0 2 x) for all x > 0, 
implying that for 0 < 0 2 := 03 A 04 , 


Eexp(0|W* — Z|) < exp ( 1 +2C0 2 n(y 2 — t] 2 ) 2 + 2 + 6Cq6 


,2 

4n 


exp f 3 + 6Cq6 2 + 2C6 2 n(y 2 — r/ 2 ) 2 


which is the desired bound. 


□ 


3 The Induction Step 

In this section we present Theorem l3.ll which we use to prove Theorem ll.4l that generalizes Theorem 
1.4 in GD. Let £\, £ 2 ,... £« be arbitrary elements of a finite set A C E. not necessarily distinct, and let 
K be a uniform random permutation of {1,2,... ,77} . For each I < k < n recall 

k kS 

Sk = Y, e *® and W k = S k -—. (34) 

i=t n 
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We show {W\.... .W n ) and a positive multiple of a Gaussian vector (Z\,... ,Z n ) obtained by eval¬ 
uating a Brownian bridge process on [0,«] at integer time points can be coupled on the same space so 
that the moment generating function of their maximum absolute difference achieves the exponential 
bound (1391 below. In place of coupling, the result of the theorem can be equivalently stated in terms 
of the existence of a joint probability function p^(s.z) on (5),... ,5 n ) and (Z\,...,Z n ) having the 
correct marginals whose joint realization obeys the desired bound. 

It will be helpful to regard the collection £ = {£1 ,..., £,,} as a multiset. We say s G R" is a ‘path’ 
corresponding to a multiset of ‘increments’ £ when there exists tz G V n , the set of permutations on 
{1such that s can be achieved by summing the increments e in the order given by 71. that is, 
when s is an element of the set of all feasible paths 

k 

A!' e := {s G R" : Sk = ^= 17T G V n }. (35) 

i=t 

Conversely, the multiset of increments corresponding to a path s is given by 

£ s ={si, — (36) 


so that s G A'l if and only if £ s = £. 

Suppose that among £ are / distinct numbers, appearing with multiplicities mi,... ,mi, necessarily 
summing to n. Then letting fg (s) be the probability mass function of (Sj,... ,S„) as given by (l34l) . 
we have 


W = 


n\ 


wi\ \rri2\ •.. mi ! 


and / e "(s) = ( s e ^e) = 


\A n e 


1 ^ 


|-1(£ S =£), 


(37) 


that is, the distribution fg (s) is uniform over Gig. 

The following result is a conditional version of Theorem 1 1.41 


Theorem 3.1. Let £\, £ 2 ,... £„ be arbitrary elements of a finite set A C R, not necessarily distinct, 71 
a uniform random permutation of { 1,2,....« }, Sk and Wk as in (1341) . and y 2 = n 1 f_f_ | £ 2 . Then there 
exists a positive universal constant C, and for every V > 0 positive constants Aj W 2 and Ao depending 
only on A and V such that for any integer n > I and every t] > V one may construct a version of 
(Wk)o<k<n aR d Gaussian random variables {Zk ) {] . k<n with zero mean and covariance 


CO V(Z,-,Z;) 


{iAj)(n-(iV j)) 
n 


(38) 


on the same probability space such that 
Eexp(A max |W) — 77 Z,|) 

0 <i<n 

< exp ^Clogn + S" + A r 2 A 2 /i(y 2 — q 2 ) 2 ^ foranyf<X o- (39) 

Proof. As the result holds trivially for A < 0 we need consider only A > 0. Also, as Wo = 0 and 
Zo = 0 by convention it suffices to consider the maximum over 1 < i < n in (l39l ). We use Theorem 
12.21 and induction to prove the theorem. 
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Recall the constants cq from Lemma [2~9l depending only on A, and c\ ,c 2 and 0 2 from Theorem 
12.21 depending only on A and v. With B given in ( fT71 ). letting 0 5 be the unique positive solution to 


sj 1 —B 4 0 2 /2 3’ 

depending only on A. We will demonstrate the claim holds with 
2 + lo<*4 

c = - .. ° . , K l =Sc l , K 2 = 18c 2 and Ao = 

log(3/2) 


(40) 


0£i 02 05 

1 A-r A- 


32ci 2 v/72^' 


(41) 


Note that any multiset £ = {gj,..., £„} of elements of A lies in exactly one set of the form 
B n (a, b) = {{£!, e 2 ,...,£n} : £ e,- = a, - £ e ; 2 = ft 2 } 


(=1 


as a and b range over all pairs of feasible values of S„ and y, respectively. Fix one such feasible pair 
a,b, which may be notationally suppressed when clear from context, let £ € B"(a,b ) be arbitrary and 
fix any value rj > 0 . 

With /g (s) the probability mass function of (Si,... ,S n ) given in (l37l) and 0”(z) the probability 
density function of a Gaussian random vector (Z l 5 ...,Z„) with mean zero and covariance (l38l ). we 
show that for each n > 1 , we can construct a joint probability function Pg (s,z) on A n e x R” having the 
desired marginals 


L Pe(s,z) 

se^te 


0"(z) and f Pg(s,z)r/z = /”(s) 
JR" 


and satisfying the exponential bound 


(42) 



exp A max 

' l<(<n 


ia 

Si - TjZi 

n 


< exp 


fdog 


n T 


Pe(s,z) 

KA 2 a 2 


d z 


F/GA 2 tt(fr 



for all A € (0 ,Aq], 


(43) 


for all r] > v, with C. K \. K 2 and Ao as in ( ITfl) . with C universal and the latter three constants depending 
only on A and v. 

We will prove the claim by induction on n. For n = 1 we note that VT| = 0 by ([2]) and Zi = 0 by 
convention, since it has mean zero and covariance given by (1381) . Hence (l39l) holds for n = 1 for all 
C, all nonnegative K\,K 2 , and all Ao, and in particular for the set of constants specified in (|4TT) . 

Given n > 2, suppose that for all / = 1,2,... ,n — 1 and all multisubsets £ of A of size / we can 
construct p£(s,z) satisfying (1421) and (1431) . Take k = [n/ 2], let U denote multiset union and define the 
sets 


S'e' k = {s : ^ for some £ 1,62 such that |£i | = k,£i U £2 = £}, and 
see 1 

B n /{s) = {{e i,£ 2 ): ^ £ = 5 ,|£i| = A,£ 1 U£ 2 =£} for s G Sg'\ 

£G£i 
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That is, S'g 1 is the set of all feasible values at time k of a path having increments £, and B'^ k (s) is the 
set of all ways of dividing the n increments £ into sets of sizes k and n — k so that the path at time 
k takes the value 5 . Counting the number of paths that take the value 5 G Sg ,k at time k shows that 
g'e' k (s), the marginal density of S k in f"( s), is given by 





I^Ci 


11 ^ 7*1 


1^1 


(44) 


Similarly, let h n,k (z ) denote the marginal density function of Z k in d''(z), that of the Gaussian 
distribution with mean zero and variance k(n — k)/n. By Theorem 12.21 there exists a joint density 
function y/ e uk (s,z) on Sg ,k x R and positive constants C\,C 2 and 62 , depending only on A and v, such 
that 

\l£’ k (s,z)dz=ge k (s), Y, Ve’ k (s,z) = h n ’ k (z), (45) 

S€S n / 


and for 9 < O 2 and 77 > v, 



v4 a 0 ,z) 


dz < exp 


| c\6 2 cr 


C29 2 n(b 2 



(46) 


For sG Sg' k ,z G R, and recalling the definition (l36l) of £ s , s 1 .s 2 such that (£ s *,£ s2 ) G Bg' k (s), 
z 1 € W k and z 2 G W'~ k , let 


where 


Y^s,z,s\z\s 2 ,z 2 ) = \i^ ,k (s,z)P e , s {e sl ,e s 2 )p k sl (s 1 ,z 1 )p" 2 *(s 2 ,z 2 ) 


(47) 


^(£ 1 , 62 ) = 






11^ 


n—k\ 


!((£!,£ 2 ) € B'e k (s)). 


Interpreting (l47l) in terms of a construction, one first samples the joint values .v and z of the coupled 
random walk and Gaussian path at time k, then chooses increments corresponding to s 1 and s 2 , the 
first and last half of the walk according to their likelihood over the choices of those whose increments 
over the first half of the walk sum to s, and whose union of increments over both halves must be £, 
and then samples coupled values of the paths with discrete Brownian bridges before and after time k. 

One may verify that yg is a density function by integrating over z 1 and z 2 using the second equality 
in d42b followed by applying the second equality in (l37l) . integrating over z, and then summing over 
all s 1 and s 2 and 5 , this last operation being equivalent to summing over all paths s with increments £, 
see (l50l) and the explanation following. 

Now, let (S,Z, S^Z^S^Z 2 ) be a random vector with density 7 ^' where S 1 = {Sj) l<i<k , S 2 = 
( s J)i<i<n-k and Z * = ( Z /)t <i<k’ 7 = ( z l)\<i< n ~k- Let S be obtained by ‘piecing’ the paths S 1 and 
S 2 together at time k according to the rule 


f S] 1 <i<k 
\ S + Sj_ k k <i<n, 


(48) 
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here noting Sk = S, and define Z by 


f Z/ + |Z 1 <i<k 
\ Z lk + ^k Z k <i <n, 


(49) 


here noting likewise that Z/t = Z, since Z} ( = 0. Now as in (6), we demonstrate that p£ (s,z), the joint 
density of (S,Z), achieves the desired marginals (l42l) and exponential bound (l43l ). 

1. Marginal distribution of S. Let s be the path constructed from .v.s 1 and s 2 as S is constructed from 
.S’.S' and S 2 in 08]). Note that 

{ S :se4”}={ S :( £ s y)e^)}, 


and that Sk = S almost surely. Hence, if S 0 A n e then from (l47l ) S has probability zero. For the marginal 
of Yg to be non-zero on .v.s 1 . s 2 , first s must be a feasible value at time lc for a path with increments 
£, then s 1 must be a path of increments that attains the value s at time k, and finally the collection of 
increments determined by s 1 and s 2 must match the given set £ of increments. In this case we obtain 
from (l42l) . (1451) and (l44l) . that the marginal distribution of (.S’. S 1 . S 2 ) is given by 


J Yg (s,z,s l ,s 2 ,z 2 )dz 2 dz l dz 

= (s'tCrV) 


•k \ sn—k(^l 

e s v 7 " e* 




n—k | 

.2 | 


\A n e 


-(Cl .We3?*(*: 


14,11 A 


n—k\ 


l^| 

= fe(s) 



(50) 


Now observing that (1481) gives a one-to-one correspondence between (.S’.S 1 .S 2 ) and S we find that S 
has marginal density /”(s) as in (1371) . 


2. Marginal distribution of Z. Consider A k £{ x A'f k , the set of all pairs of paths (s 1 , s 2 ) with 
increments £i and £2 respectively. Using (1421) and (l45l) . and noting that (£ s ',£ s ") = (£1 ,£2) for 
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(s ,s 2 ) € A El x A El \ the marginal distribution of Z, Z 1 , Z~ is given by 

£ £ £ ?e s 1 , z 2 , s 2 , z 2 ) 

.65;’’* (ei,e 2 )65;’*(.) (sb^Je^x^-* 


= £ v4’ k (s,z) 


s€ S" 


= £ V'e' M 


seS" 


£ P e ,.(Ci,e 2 ) £ p| 1 (s 1 ,z 1 )p"/(s 2 ,z 2 ) 

(ei,e 2 )es;’*(.) (sVleA*. x.4;-* 

£ ^(ei,e 2 ) £ Pe,(s\z l ) £ p£“*(S 2 ,z 2 ) 


.(«iA)€sr(») 


s'e^j 


s 2 e^-* 


= £ v£’*M 


. 65 ; 


£ P e , i (£ 1 ,£ 2 )^(z 1 )f^(z 2 ) 


(e,,e 2 )es;’» 


= ^(z 1 )^(z 2 )£^(,,z) 

= ( p n ~ k (z 2 )(f) k (z l )h n ' k (z ) 

where we have used that £ g £ f g„,* P £v (£ i .£ 2 ) = 1. Hence Z, Z 1 and Z 2 are independent with 

densities h' hk (z), ^(z 1 ) and <j) n ~ k (z 2 ) respectively, implying that Z given by (l49l ) is a multivariate 
mean zero Gaussian random vector. As in (5), one can verify that Z has covariances given by ( l38l ). 
and hence Z rsj 0"(Z). 


3.The exponential bound. For 1 < i < n, letting 


m 

Wi = Si--, 

n 


we show that 


1 <i<n 


K\X 2 a 


Eexp(A max \Wj — 17 Z,-1) < exp Clog/iH-b A 2 A“«(Zr — rj 2 ) 2 for A e (0 ,Aq] 


where C,K\,Ki and Ao are as in (IdTT) . We continue to proceed as in l 6 l. 
Again writing S for S k , let 


Tl := max 

1 <i<k 


s!-§-nz! 


,Tr := max 

k<i<n 


i i — k, 

Sl k -—(a-S)-r]Z- 


n — k 


2 

i~k 


and 


T ■= 


ka 

S -r]Z 

n 


Note that when 1 < i < k we have 

|Wi- — T|Zf| = 
< 


5 , 1 ---t 7 (z/ + 


+ 


iZ 


sj-f-iz? 


iS ia 

i 

k n 

k 


r ?Z 


<r L + -r<r L + r. 

k 
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Similarly for k <i <n one can verify VV, — T]Z/| <Tr + T, proving 


max | Wj — rjZ,-\ < max{r L + T, T R + T}. 

l<i<n 

Now fixing A < Ao, the inequality exp(.r Vy) <e x + e y yields 

exp(A max |W,- — 17 Z ? -1) < exp(Ar L + Ar) + exp(A7A + AT). (51) 

1 <i<n 

To prove that the exponential bound holds, we develop inequalities on the expectation of the two 
quantities on the right hand side of (BTT) . starting with the expression involving 7) . 

Note that £ s determines S, and since £ is fixed £ s is also determined, so by ( |47l ) the conditional 
density of (S 1 . Z 1 ) given (£ s ,Z) is p^ s , (s' .z 1 ). Now using that the moment generating functions of 
T l and T are finite everywhere and that T is a function of {S.Z\, invoking the induction hypothesis 
and applying the Cauchy-Schwarz inequality twice, with y 2 = (I /k) , £ 2 ( . ( we obtain 


Eexp(A?i + AT) = E E (exp(Ar^)|f f ' ,ZJ exp(AT) 

E ^E ^exp(A7i)|£ s ,Z) ) E(exp(2Ar)) 
'2^iA 2 S 2 


< 


1 V 2 


< exp (C log k) 


Eexp 


k + 2 &A 2 k(yf-rj 2 ) 2 jEexp( 2 Ar) 


1/2 


< exp (C log k) 


Eexp 


4K { X 2 S 2 


k ) 


) Eexp (4K 2 k 2 k(tf - q 2 ) 2 ) 


1/4 


(Eexp( 2 Ar)) 1/2 . (52) 


For the first expectation in (l52l) . (l44l) implies that 0 < 4AT 1 A 2 < cq, and as \2k — n\ < 1 we may 
invoke Lemma l29l to yield 


^ ,'4K l l 2 S 2 \ ( 3K { ?i 2 a 2 

Eexp ---) < exp ( 1 4 - 


For the second expectation in (1521) . recalling the definition of y,, 


(< 


(53) 


Eexp (4K 2 A 2 k(y[ - q 2 ) 2 ) = Eexp ^4&A 2 ^£(£ 2 ( . } - rj 2 ) ] ] =Eexp(V%), (54) 
where 6 = 2A s/K^, and we write 


i— 1 


i'=l 


u k = £(4 0 - vr) = £ ( £fhen({k]) -~ r l 2 )= X>«> 


1=1 

2i 


where [k] = { 1 ,... ,k} so that n([k\) = {7 z(i) : i = 1 , 2 ,... ,k}, and a, = £fl i€7t ([k]) — (k/n)r\ 
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To bound (l54l) . we will argue as in Lemma 127)1 Observe that for V a standard normal random 
variable independent of £4, 


Eexp = Eex P (y20-^=£4 


V 


= Eexp(V2 e^^l Uk 


Vk 

= Eexp (^/2d^=U k 


sgn(L) = 1 ) P(sgn(V) = 1) 


+ Eexp(\/2 0-^(-£4) 


sgn(L) = -1 P( sgn(L) = -1). 


Now using the independence of |L| and sgn(V), and that sgn(L) is a symmetric ±1 random variable, 
we obtain 


Eexp (d 2 ^) = - Eexp (\/2QUi)—)=\ +Eexp (y/l 0 (-£/*) 






Vk) 




Vk 


(55) 


Recall that random variables X\ ,X 2 , ■. ■ ,X n are said to be negatively associated, see lfl5l . if for any 
two disjoint index sets I and /, 


E \f(Xi,i € I)g(XjJ G /)] < E \f(X h i G I)]E[g(Xj,j G J )] (56) 

for all coordinatewise nondecreasing functions / : —>• M and g : Rl J l —> R. 

Let X\ ,... ,X n be negatively associated. It is immediate that aX\ + b,.. ., aX n + b are negatively 
associated for all a > 0 and b G R. In addition, letting Y t = -X, for all / = I..... n, for / and g 
coordinatewise nondecreasing functions and I and J disjoint index sets, as —/(—•) is coordinatewise 
nondecreasing, we have 


E \m,i G I)g(Yj,j G 7)] = E [(-f(-X h i G I))(-g(-XjJ G 7))] 

< EG r))]E[(-g(-Xj,j G 7))] = E \f(Y h i G I)]E\g(YjJ G 7)], 


demonstrating that — X \,..., —X n are negatively associated. Combining these two facts, aX\ +b..... aX n + 
b are negatively associated for all a G R and b G R. By a direct inductive argument on (l56l) . 


E 



<n E [/<^)] 

i=i 


(57) 


whenever the functions fi,i = 1,2,... ,n are all nondecreasing. 

By Theorem 2.11 of 11751 . taking the real numbers in Definition 2.10 there to consist of k ones and 
n — k zeros, the indicators 1 lg^^])• ■ • • ■ are negatively associated; hence so arc a t .....a n and 

— ai,..., —a n . Thus, by (l57l) . we have 


E 


exp 


(V2 eui 


\V[ 

Vk 


V 


= E 


<f[iE 

i=i 


exp 


V 


V2 eaM 

\/k 


HE 

i=i 


V 


exp 


\/20 


in 

” (0 v ) Vk 


v 


(58) 
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Now since — q 2 < £ 2 (rj — r \ 2 < B 2 — r\ 2 , using Hoeffding’s le mm a (fl2l) with /J = b 2 — r\ 2 , the 
mean of e 2 (i , -tj 2 , we obtain 


HE 

i= 1 


exp 


f' / 2e(4,0 


~T) 


Vk 


v 


= exp 


< exp 
B 4 6 2 V 2 


B 4 0 2 V 2 


, *^3) 


■ + 


V20/iVJfc|V|J 


< exp ( — - (- V26lJ.VkV ) +exp ( -f- V26fjVk(—V) 


Using that V and —V have the same distribution, taking expectation in (1581) and then applying the 
non-central chi square identity (fl3l) yields 


E 


exp (^V 2 dUk^= 


<2E 


B 4 6 2 V 2 r r \ 
exp [ ---h V2 6\iVkV I 


4 

2 


: exp 




< A exp I - k0 2 fi 2 ) (59) 


y/l —B 4 0 2 /2 *\V1-B 4 6 2 /2) - 3^ f V3 

for all 0 < 6 < 65 , by (l40l) . 

Using the fact that — a \,..., — a n are negatively associated and that —at and at have supports over 
intervals of equal length for all i = 1,2,... ,n, (l59l ) holds with £4 replaced by —£4. Thus, by 


(60) 


Eexp ( < ^ ex P ) for 0< 6 < 65. 


Using m we see that 0 < 4 AT A 2 < 6 2 , and as k < y, by (l54l ) and ( l60l ). and recalling that 
jj = b 2 — T7 2 , we have 


Eexp (4K 2 X 2 k{y^ — T] 2 ) 2 ) 


< ^exp f^-K 2 ?i 2 k(b 2 - ri 2 ) 2 ) < 3exp f^K 2 A 2 n(b 2 - r / 2 ) 2 


For the third expectation in (l52l) . again by (f4J~b . 0 < 2A < d 2 . Hence by (l46l) . 
Eexp(2AT) < exp (2 + ———— + 4c 2 A 2 /?(& 2 — T7 2 ) 2 


Applying bounds (l53l) . (I6TI) and (l62l) in (1521) . and setting 
3KV 2 a 2 32K 2 X 2 n{b 2 - r ] 2 ) 2 


Qn — 1 + 

we obtain 


■ + ■ 


4 ci A 2 a 2 


and Q 2 = 3+ ~~ 1 + 4c 2 X 2 n(b 2 — rj 2 ) 2 , 


( 61 ) 


(62) 


Eexp(AT L + Ar) <3 1/4 exp fciogk+^Qn + ^Qi 


^ , (3ATi + 8ci)A 2 n 2 (8AT 2 + 18c 2 ), 2 /, 2 

< 2exp ( Clogk + 2 + -—1-f 2 —— -— X 2 n(b 2 -r } 2 ) 2 
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Again by (|4TT) . 3Ki + 8 ci = AK\ and 8^2 + I 8 C 2 = 9Ki. Since k < 2n/3, we have 
logk = log n — log (n/k') < log n — log (3/2). 

Thus, using from (ITTI) that Clog(3/2) = log4 + 2, 

Eexp(AT L + XT) < 2exp ^Clogn — Clog(3/2) + 2 + a -\-K 2 ?i 2 n(b 2 - r/ 2 ) 2 

= ^exp ^Clogn+^—^—+ K 2 ?i 2 n(b 2 — T] 2 ) 2 ^ . 

In like manner we obtain this same bound on Eexp(A7« + XT), so (BTI) . now yields 

exp(A max |W/ — 17 Z,|) < exp (c\ogn + + KiX 2 n(b 2 — T] 2 ) 2 

1 <i<n y n 

This step completes the induction, and the proof. □ 

Proof of Theorem 17.41 Let A be the set of the r distinct values {a \,..., a, } and let £], £ 2 : , £ n be 
exchangeable random variables taking values in A. Let 

n 

M = (Mi,M r ) where for j = 1,..., r we set Mj = Y 1 (£,■ = aj), 

1=1 

the number of components of the multiset £ = {£ 1 ,..., £„} that take on the value aj. With C denoting 
distribution, or law, clearly 

£(£!,£ 2 ,...,£„) = Y E(£i,£ 2 ,...,£„|M = m)7 3 (M = m) 

m>0 

where m = (, m\,...,m r ) and m > 0 is to be interpreted componentwise. As M is a symmetric 
function of £ 1 ,£ 2 ,... ,£„, the conditional law C(e\,e 2 , ■ • • ,£„|M = m) inherits exchangeability from 
£(£i,£ 2 ,...,£„), that is, 

C,(E\iE2i ■ ■ • i E n |M m) ^ C,(Eji^ , E x (2) > • ■ • > ^n(n) ®) 

where n is uniformly chosen from V„. In particular, given M = m, 

k k 

Y £ ‘ =d Y e M‘) for all k = 1 ,..., n 

i= 1 i=l 

where = f ; denotes equality in distribution. Hence, (l39l ) of Theorem 13. 1 1 yields the version of the hrst 
claim of Theorem 11.41 when conditioning on M, and taking expectation over M yields that result. 

We now demonstrate the second claim under the assumption that 0 0 A, which together with A 
finite implies that 



v = min|n| (63) 

aeA 

is positive. With this value of v the constants ci,C 2 and d 2 as given by Theorem 12.21 depend only 
on A, and let C,K\ ,K 2 and Aq be as given in (1441 for this v. As y > v, conditional on £ 1 ,... ,£,„ 
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inequality (l39l ) of Theorem [371] holds for r 7 = 7 , and the argument is completed by taking expectation 
over M as for the proof of the first claim. 

For the last claim, under the hypotheses that E \,..., £ n are i.i.d. mean zero random variables, since 
K\ depends only on A, by Lemma [276] there exists A > 0 depending only on A such that 


E 


( K { X 2 Sl 


< 2 . 


V n 

Thus from the second claim of the theorem we obtain 


Eexp(A max \Wk — \fnyB k i n \) < 2 exp (Clog n), 

0 <k<n 


and applying Markov’s inequality yields 


P 


max 

VO <k<n 


\W k - VnyB k / n \ > Z 'Clogn + x^ < 


Eexp(A max 0 < k < n \W k - y/nyB k / n \) _ Xx 
exp (Clog n) 


2 exp (Clog n) __ Xx Xx 

~~ exp(Clog n) 


□ 


4 Proof of Theorem 11.3 

In this final section we prove Theorem 1 1.31 by first demonstrating a ‘finite n version’ of the desired 
result in the following lemma. 

Lemma 4.1. There exists a constant A such that for every finite set A of real numbers not containing 
zero, there exists a constant A > 0 such that for any positive integer n, any £, £| , £„ i.i.d. 

random variables with mean zero and variance one satisfying Ee 3 = 0 and taking values in A, and 
S/ ( = Ya= 1 = 1 it is possible to construct a version of the sequence (Sk )()<k<n kind Gaussian 

random variables (Zk)o<k<n with mean zero and Cov(Z,-,Z 7 ) = i A j on the same probability space 
such that 


Eexp(A|S„ — Z„|) <A (64) 

and 


Eexp(A max |S* — Z&|) < Aexp(Alogn). (65) 

0 <k<n 

Proof As in Theorem 13.11 it suffices to prove the result with the maximum taken over 1 < k < n. 
Recall the positive constant 0\ from Theorem 12.11 the values from Lemma 12761 B from (fT71) . and 

let C, K\ , K 2 and Aq be as in Theorem [L4] for v = min ne ^ |a|. Set 


A = min 


01 Ao B t(e) B t(e 2 ) 1 | 

T’T y/2. ’B+l J ' 


( 66 ) 


Let g n (s) and h" (z) denote the mass function of S„ and the density of Z„ respectively; in particular 
h n (z) is just the Af(0,n) density. By Theorem 12.11 as 2A < 0\ , with S” the support of S n , there is a 
joint probability function xi/ n (s,z) on S " x M such that 

[ V"(s,z)dz = g n (s), Y, V"( s >z) =/l "(z)> (67) 

7r seS „ 
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and 


Y exp(2X\s-z\)v"{s,z) 


seS" 


dz < 8 . 


( 68 ) 


Given any multiset of values e = {ei,... ,£„} from A, let pg(s,z) be the joint density function 
guaranteed by Theorem 13.11 from that result, the marginal distributions of s and z are, respectively, 
/”(s) as in (l37l) . and 0''(z), that of a mean zero Gaussian vector with covariance (l38l) . 

For any s € S'\ define 


B n (s) = {{ £ i,£ 2 ,•••,£«}: Y e ‘ = 4- 

i=t 


Now, recalling the definition (l36l) of 6 s , for s € 5”, s such that £ s € z € R and z 6 M”, let 


7"Cfz,s,z) = yr"(s,z)F(£ = £ s | S„ = s)p£ s ( s,z), 


(69) 


where the multiset £ on the right hand side is composed of n independent random variables distributed 
as e. Interpreting (l69l) in terms of a construction, to obtain (S,Z, S,Z) one first samples the joint values 
S and Z of the coupled random walk and Gaussian path at time /i, then conditional on the terminal 
value S, one samples increments £ consistent with the path s from their i.i.d. distribution, and finally 
one couples a walk S to the discrete Brownian bridge Z in such a way that a certain multiple of it and 
(Wi,...,W n ) given by 


Wi = Si--S n 
n 


are close. 

To verify that (l69l) determines a probability function, recalling (l35l ). note first that 


(70) 


£ P(£ = £ s |S„ = s)pg S (s,z) 

s:e s eB"(i) 

= L £ P(£ = 5|5„ = 5)p^(s,z)= £ P{e = 8\S n = s) £ Ps(s,z) 

SeB"(s) seA n s SeB" (s) s<=A% 

= Y, P{£ = S\S n =s)(l) n (z) = (l)"(z). 
SeB"(s) 

Now by (l67l) . 

Y L r"(^,z,s,z) = (7i) 

s€S" s:e s eB n (s) 

and integrating over z and z yields 1 . 

Let (S,Z, S,Z) be a random vector sampled from y n (s,z, s,z), and define Z = (Z\,... ,Z n ) by 

Zj = Z; H-Z. 

n 

Using that Z and Z are independent by (fTil . and that the latter has covariance given by (l38l) . it follows 
that Z is a mean zero Gaussian random vector with Cov(Z,-,Zy) = i A j. 
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Regarding the marginals of s, integrating ( l69l ) over z and z, with f£(s) given by (fTTI) . we obtain 

[ [ f(s,z,s,z)dzdi = g n (s)P(£ = £ s | S„ = s)ft( s) = P(e = f s )/£(s) = P(e = e s ) 

JR" JR |Ag S | 

The first term is the likelihood that the independently generated increments corresponding to those 
of s, while the second term is the chance that these increments will be arranged by the uniform 
permutation in an order that produces s. Hence, the marginal correspond to the distribution of S. 

It only remains to show that the pair (S,Z) satisfies the bounds (l64l) and (l65l) . Note that for 
1 < i < n, recalling (l70l) . we have 



i 

(~ i \ 

Z;| = 

Wi + -S- Z; H Z 

n 

\ n J 


< \Wi-Zi\ + -\S-Z\. 


(72) 


From (l69l) . one can easily check that the conditional distribution of (S,Z) given (£ S .Z) = (b,z) is 


Pe(s,z). 

Let y 2 = n 1 YH=\ £ f an d recall v = min ae _4 \a\ > 0. As 7 > v and 4A < Ao by 
invoke Theorem 13.11 conditional on {£,Z}, and choosing r] = 7 we obtain 

\6K { X 2 Sl 

n 


E(exp(4A max | Wj - tZ 2 |)If ,Z) < exp ( Clog/r + 

l<i<n ' 


with C and K\ depending only on A. Applying the Cauchy-Schwarz inequality and 
are measurable with respect to {f,Z}, from (l72l) we obtain 


< 


Eexp(A max IS; —Z;|) 

1 <i<n 

E("E(exp(A max \W{ — Z;|)|£,Z)') 2 Eexp(2A|S — Z|) 

V \<i<n / 


11/2 


< 


8 E^E(exp(A max \Wi — Z;|)|f,Z)^ 


2i 1/2 


we may 

(73) 
as S and Z 


(74) 


Using conditional Jensen’s inequality, the triangle inequality and the convexity of the exponential 
function in the first three lines below, (1731) yields 


E(exp(A max |W; — Z 2 [)If,Z) 

V 1 <i<n 

<E(exp(2A max \Wi — Z;|)|fi,Z) 


< ^-E(exp(4A max |Wj- —yZ;|)|f,Z) + ^-E(exp(4A max 1 7 Z; — Z;|)|fi,Z) 

] 2 c2' 


1 / 16^iA 2 S 2 \ 1 , , ,, , ,~ IX| , 

< -exp ( Clog 7 iH-j + -E(exp( 4 A| 7 — 1| max |Z;|)|fi,Z) 


n 


< exp(Clog 7 i) + -E(exp(4A|7— 1| max |Z,|)If ,Z). 

2 1 <i<n 


(75) 
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< 2 , 


For the first term in the fourth line, Lemma [2T6l yields 


Eexp 




since £1 has mean zero, |£i| < B in (fT7T) and A\fK[X < f^( e ) by (1661 . 

For the second term in ( |75T ). observe that conditional on (e,Z), Z is a mean zero multivariate 
Gaussian random vector with covariance given by (138k Equivalently, conditional on (g,Z), the distri¬ 
bution of (Zj / yjn)\<i< n is that of a Brownian bridge on [0,1] sampled at times l/«,2/w,..., 1. Thus, 
letting B u t € [0,1] be a Brownian bridge independent of (e,Z), since y is a function of £, we have 

E(exp(4A|y — 1| max |Z;|)|g,Z) 

1 <i<n 

=E(exp(4\/nA|y— ll max ^1)1 e,Z) 

\<i<n sfn ' ’ 

=E(exp(4 v / nA|y— 1| max |fi f |)|g,Z) 

te[n]/n 

<E(exp(4\/nA|y— 1| max^ |fi f |)|g,Z) 

<E(exp(4v / nA|y— ll max B t ) + exp(4\/nA|y— 11 max (— B t ))\e,Z). 

v 0<r< 1 0<f<l 1 ’ 

From Il25l . the distribution of X = max 0 < f <i B t is given by 

P(X < x) = 1 — exp(— 2x 2 ) for x > 0. 


Using this identity, and the fact that — B, is also a Brownian bridge, it is straightforward to show that 
for any real number a, we have 


E(exp(n max B,) +exp(n max 
v 0<?<i o<r<i 


(—B t ))) < 2 + v / 27roexp(a 2 /8). 


Thus, since B t and y are respectively independent of, and a function of, £, we obtain 

E(exp(4A|y— 1| max |Z,-|)|e,Z) 

<2 + v / 27r4\/nA|y— l|exp (2A 2 «(y— l) 2 ) 

<2 + 4(B+l)\/27mAexp(2A 2 n(y 2 -l) 2 ) (76) 

where in the last step, we used | y — 11 < B + 1 where B is given by (fTTT ). and that y > 0 implies 
l<(y+l) 2 . 


Since Egj 2 = 1, we have «(y 2 — l) 2 = (LLi( e ? — Eg 2 )) - /' 1 an< i E(g 2 — Eg 2 ) = 0. As g 2 < B 2 and 
0 < s/lX, < by (l66l) . Lemma [T6l yields 

Eexp (2A 2 n(y 2 — l) 2 ) < 2. 

Additionally, since A(B+ 1) < 1 by (l66l ). taking expectation in (l76l) yields 

E(exp(4A|y— 1| max |Z,-|)) = 2 + 8(B+ l)y/2nnZ < exp(Cilogn) (77) 

1 <i<n 


30 





for some universal constant C\. 

Thus, by (l74l ). (1751) and (1771) . we have 


E exp (A max | .S', — Z,j) 

\<i<n 


< 


8E^exp (Clog/?) + -E(exp(4A|y— 1| max |Z,-|)|e,Z) 


1 1/2 


< 8'/2 
<A cxp(4 log/?) 


exp(Clogn) + -exp(Ci log 7?) 


1/2 


for some universal constant A, which we may take to be at least 8. The proof of ([(77! ) is now complete. 
Lastly note that Z n = 0 implies Z n = Z, hence (1681) yields (l64l ) as A > 8. □ 

Theorem II .31 follows from Lemma 14. 1 1 in exactly the same way as Theorem 1.5 follows from 
Lemma 5.1 in (61, noting that the reasoning applied at this step does not depend on the support of the 
summand variables of the random walk. 
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